Search CORE

13 research outputs found

Pan-cancer classifications of tumor histological images using deep learning

Author: Caruana Dennis
Chuang Jeffrey H.
Farahmand Saman
Foroughi pour Ali
Namburi Sandeep
Noorbakhsh Javad
Rimm David
Soltanieh-Ha Mohammad
Zarringhalam Kourosh
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 04/03/2020
Field of study

Histopathological images are essential for the diagnosis of cancer type and selection of optimal treatment. However, the current clinical process of manual inspection of images is time consuming and prone to intra- and inter-observer variability. Here we show that key aspects of cancer image analysis can be performed by deep convolutional neural networks (CNNs) across a wide spectrum of cancer types. In particular, we implement CNN architectures based on Google Inception v3 transfer learning to analyze 27815 H&E slides from 23 cohorts in The Cancer Genome Atlas in studies of tumor/normal status, cancer subtype, and mutation status. For 19 solid cancer types we are able to classify tumor/normal status of whole slide images with extremely high AUCs (0.995±0.008). We are also able to classify cancer subtypes within 10 tissue types with AUC values well above random expectations (micro-average 0.87±0.1). We then perform a cross-classification analysis of tumor/normal status across tumor types. We find that classifiers trained on one type are often effective in distinguishing tumor from normal in other cancer types, with the relationships among classifiers matching known cancer tissue relationships. For the more challenging problem of mutational status, we are able to classify TP53 mutations in three cancer types with AUCs from 0.65-0.80 using a fully-trained CNN, and with similar cross-classification accuracy across tissues. These studies demonstrate the power of CNNs for not only classifying histopathological images in diverse cancer types, but also for revealing shared biology between tumors. We have made software available at: https://github.com/javadnoorb/HistCNNFirst author draf

Boston University Institutional Repository (OpenBU)

Deep learning features encode interpretable morphologies within histological images.

Author: Chuang Jeffrey H
Foroughi Pour Ali
Park Jonghanne
Sheridan Todd B
White Brian S
Publication venue: The Mouseion at the JAXlibrary
Publication date: 08/06/2022
Field of study

Convolutional neural networks (CNNs) are revolutionizing digital pathology by enabling machine learning-based classification of a variety of phenotypes from hematoxylin and eosin (H&E) whole slide images (WSIs), but the interpretation of CNNs remains difficult. Most studies have considered interpretability in a post hoc fashion, e.g. by presenting example regions with strongly predicted class labels. However, such an approach does not explain the biological features that contribute to correct predictions. To address this problem, here we investigate the interpretability of H&E-derived CNN features (the feature weights in the final layer of a transfer-learning-based architecture). While many studies have incorporated CNN features into predictive models, there has been little empirical study of their properties. We show such features can be construed as abstract morphological genes ( mones ) with strong independent associations to biological phenotypes. Many mones are specific to individual cancer types, while others are found in multiple cancers especially from related tissue types. We also observe that mone-mone correlations are strong and robustly preserved across related cancers. Importantly, linear mone-based classifiers can very accurately separate 38 distinct classes (19 tumor types and their adjacent normals, AUC = [Formula: see text] for each class prediction), and linear classifiers are also highly effective for universal tumor detection (AUC = [Formula: see text]). This linearity provides evidence that individual mones or correlated mone clusters may be associated with interpretable histopathological features or other patient characteristics. In particular, the statistical similarity of mones to gene expression values allows integrative mone analysis via expression-based bioinformatics approaches. We observe strong correlations between individual mones and individual gene expression values, notably mones associated with collagen gene expression in ovarian cancer. Mone-expression comparisons also indicate that immunoglobulin expression can be identified using mones in colon adenocarcinoma and that immune activity can be identified across multiple cancer types, and we verify these findings by expert histopathological review. Our work demonstrates that mones provide a morphological H&E decomposition that can be effectively associated with diverse phenotypes, analogous to the interpretability of transcription via gene expression values. Our work also demonstrates mones can be interpreted without using a classifier as a proxy

The Jackson Laboratory: The Mouseion at the JAXlibrary

PubMed Central

Deep learning-based cross-classifications reveal conserved spatial behaviors within tumor histological images.

Author: Caruana Dennis
Chuang Jeffrey
Farahmand Saman
Foroughi Pour Ali
Namburi Sandeep
Noorbakhsh Javad
Rimm David
Soltanieh-Ha Mohammad
Zarringhalam Kourosh
Publication venue: The Mouseion at the JAXlibrary
Publication date: 11/12/2020
Field of study

Histopathological images are a rich but incompletely explored data type for studying cancer. Manual inspection is time consuming, making it challenging to use for image data mining. Here we show that convolutional neural networks (CNNs) can be systematically applied across cancer types, enabling comparisons to reveal shared spatial behaviors. We develop CNN architectures to analyze 27,815 hematoxylin and eosin scanned images from The Cancer Genome Atlas for tumor/normal, cancer subtype, and mutation classification. Our CNNs are able to classify TCGA pathologist-annotated tumor/normal status of whole slide images (WSIs) in 19 cancer types with consistently high AUCs (0.995 ± 0.008), as well as subtypes with lower but significant accuracy (AUC 0.87 ± 0.1). Remarkably, tumor/normal CNNs trained on one tissue are effective in others (AUC 0.88 ± 0.11), with classifier relationships also recapitulating known adenocarcinoma, carcinoma, and developmental biology. Moreover, classifier comparisons reveal intra-slide spatial similarities, with an average tile-level correlation of 0.45 ± 0.16 between classifier pairs. Breast cancers, bladder cancers, and uterine cancers have spatial patterns that are particularly easy to detect, suggesting these cancers can be canonical types for image analysis. Patterns for TP53 mutations can also be detected, with WSI self- and cross-tissue AUCs ranging from 0.65-0.80. Finally, we comparatively evaluate CNNs on 170 breast and colon cancer images with pathologist-annotated nuclei, finding that both cellular and intercellular regions contribute to CNN accuracy. These results demonstrate the power of CNNs not only for histopathological classification, but also for cross-comparisons to reveal conserved spatial behaviors across tumors

The Jackson Laboratory: The Mouseion at the JAXlibrary

Deep learning-based cross-classifications reveal conserved spatial behaviors within tumor histological images

Author: Caruana Dennis
Chuang Jeffrey H.
Farahmand Saman
Foroughi Pour Ali
Namburi Sandeep
Noorbakhsh Javad
Rimm David
Soltanieh-Ha Mohammad
Zarringhalam Kourosh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/12/2020
Field of study

Boston University Institutional Repository (OpenBU)

The Jackson Laboratory: The Mouseion at the JAXlibrary

Optimal Bayesian Feature Selection: A New Approach for Biomarker Discovery

Author: Foroughi pour Ali
Publication venue: The Ohio State University / OhioLINK
Publication date: 25/09/2019
Field of study

OhioLINK Electronic Thesis and Dissertation Center

Heuristic algorithms for feature selection under Bayesian models with block-diagonal covariance structure

Author: Ali Foroughi pour
Lori A. Dalton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2018
Field of study

Abstract Background Many bioinformatics studies aim to identify markers, or features, that can be used to discriminate between distinct groups. In problems where strong individual markers are not available, or where interactions between gene products are of primary interest, it may be necessary to consider combinations of features as a marker family. To this end, recent work proposes a hierarchical Bayesian framework for feature selection that places a prior on the set of features we wish to select and on the label-conditioned feature distribution. While an analytical posterior under Gaussian models with block covariance structures is available, the optimal feature selection algorithm for this model remains intractable since it requires evaluating the posterior over the space of all possible covariance block structures and feature-block assignments. To address this computational barrier, in prior work we proposed a simple suboptimal algorithm, 2MNC-Robust, with robust performance across the space of block structures. Here, we present three new heuristic feature selection algorithms. Results The proposed algorithms outperform 2MNC-Robust and many other popular feature selection algorithms on synthetic data. In addition, enrichment analysis on real breast cancer, colon cancer, and Leukemia data indicates they also output many of the genes and pathways linked to the cancers under study. Conclusions Bayesian feature selection is a promising framework for small-sample high-dimensional data, in particular biomarker discovery applications. When applied to cancer data these algorithms outputted many genes already shown to be involved in cancer as well as potentially new biomarkers. Furthermore, one of the proposed algorithms, SPM, outputs blocks of heavily correlated genes, particularly useful for studying gene interactions and gene networks

Directory of Open Access Journals

Additional file 1 of Heuristic algorithms for feature selection under Bayesian models with block-diagonal covariance structure

Author: Ali Foroughi pour (4991417)
Lori Dalton (4991420)
Publication venue
Publication date: 01/03/2018
Field of study

Supplementary. Additional detail on the synthetic simulations and a comparison of classification error of the selected features of each algorithm is provided. The supplementary contains the top 100 selected genes and top 20 enriched pathways of each of the proposed algorithms as well as limma. (PDF 507 kb

Crossref

Directory of Open Access Journals

FigShare

Heuristic algorithms for feature selection under Bayesian models with block-diagonal covariance structure

Author: A Foroughi pour
A Foroughi pour
A Foroughi pour
A Foroughi pour
A Kohlmann
A Kühnl
Ali Foroughi pour
EP Diamandis
H Mi
H Mi
J Beirlant
J Hua
JJ Smith
KP Murphy
LA Dalton
Lori A. Dalton
N Ramachandran
N Rifai
R Edgar
SE Ilyin
SN Goodman
T Haferlach
TJ Freeman
U Braga-Neto
Y Pawitan
Z Feng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref